SEQUENCE LISTING 

<110> Gellissen, Gerd 
Braus , Gerhard 
Pries, Ralph 
Krappmann, Sven 
Strasser, Alexander 

<120> Nucleic Acid Molecule Comprising a Nucleic Acid Coding for a 
Polypeptide with Chorismate Mutase Activity 

<130> 029474-5007-00 ^ 

m 

o o 

<140> 10/042059 rO 
<141> 2001-10-25 ^ pi 

<150> DE 199 19 124.7 rn rn 
<151> 1999-04-27 ^ 

<160> 7 ^ o ' 

<170> Patentln version 3.1 7^ u> 

<210> 1 

<211> 843 

<212> DNA 

<213> Hansenula polymorpha 
<220> 

<221> CDS 

<222> (1) . . (843) 

<223> 

<400> 1 

atg gac ttt atg aag cca gaa aca gtg ctg gac ctt ggc aac att aga 48 

Met Asp Phe Met Lys Pro Glu Thr Val Leu Asp Leu Gly Asn lie Arg 

15 10 15 

gat gcc ttg gtc egg atg gag gat acg ate ate ttc aac ttt ate gag 96 
Asp Ala Leu Val Arg Met Glu Asp Thr lie lie Phe Asn Phe lie Glu 
20 25 30 

egg teg cag ttc tat gcg teg ccc teg gta tac aaa gtc aac cag ttc 144 
Arg Ser Gin Phe Tyr Ala Ser Pro Ser Val Tyr Lys Val Asn Gin Phe 
35 40 45 

cct att ccc aac ttc gac ggc teg ttc ttg gac tgg ctg ttg teg cag 192 
Pro lie Pro Asn Phe Asp Gly Ser Phe Leu Asp Trp Leu Leu Ser Gin 
50 55 60 

cac gag cga ate cat teg cag gtg agg aga tac gac gcg cca gac gag 240 
His Glu Arg lie His Ser Gin Val Arg Arg Tyr Asp Ala Pro Asp Glu 
65 70 75 80 



gtg cct ttt ttc ccc aac gtg ctg gaa aaa acg ttt ctg ccc aag ate 
Val Pro Phe Phe Pro Asn Val Leu Glu Lys Thr Phe Leu Pro Lys lie 
85 90 95 



288 



aac tac cca teg gtg eta gee tec tac gcg gat gaa ate aac gtc aac 336 
Asn Tyr Pro Ser Val Leu Ala Ser Tyr Ala Asp Glu lie Asn Val Asn 
100 105 110 

aaa gag ata etc aag ate tac acg tea gag ata gta cca gga ata get 384 
Lys Glu lie Leu Lys lie Tyr Thr Ser Glu lie Val Pro Gly lie Ala 
115 120 125 i 

gca ggc age gga gag cag gag gac aac ctt ggc teg tgc gca atg gee 432 
Ala Gly Ser Gly Glu Gin Glu Asp Asn Leu Gly Ser Cys Ala Met Ala 
130 135 140 

gac ate gag tgc ctg cag teg eta tec aga aga ate cat ttt ggc cgt 480 
Asp lie Glu Cys Leu Gin Ser Leu Ser Arg Arg lie His Phe Gly Arg 
145 150 155 160 

ttt gtc gca gag get aaa ttt ate agt gag ggg gac aag att gtg gat 52 8 

Phe Val Ala Glu Ala Lys Phe lie Ser Glu Gly Asp Lys lie Val Asp 
165 170 175 

ctg ate aaa aag aga gat gtg gaa ggc att gag gcg etc ate aca aac 576 
Leu lie Lys Lys Arg Asp Val Glu Gly lie Glu Ala Leu lie Thr Asn 
180 185 190 

gec gag gtc gaa aaa egg ate ttg gac aga ctt ctg gag aag gga agg 624 
Ala Glu Val Glu Lys Arg lie Leu Asp Arg Leu Leu Glu Lys Gly Arg 
195 200 205 

gcg tat gga aca gac ccg aca eta aag ttc acg cag cac att cag age 672 
Ala Tyr Gly Thr Asp Pro Thr Leu Lys Phe Thr Gin His lie Gin Ser 
210 215 220 

aag gtg aag ccc gag gtg att gtg aaa ate tac aag gat ttc gtg att 72 0 

Lys Val Lys Pro Glu Val lie Val Lys lie Tyr Lys Asp Phe Val lie 
225 230 235 240 

ccg etc acg aag aag gtc gaa gtc gac tac ttg ctg aga egg ctg gag 768 
Pro Leu Thr Lys Lys Val Glu Val Asp Tyr Leu Leu Arg Arg Leu Glu 
245 250 255 

gac gag gag gac gat gat gcg acg cag aaa age ggc ggc tac gtt gac 816 
Asp Glu Glu Asp Asp Asp Ala Thr Gin Lys Ser Gly Gly Tyr Val Asp 
260 265 270 

egg ttt etc tec tct ggc ttg tac tag 843 
Arg Phe Leu Ser Ser Gly Leu Tyr 
275 280 



<210> 2 

<211> 280 

<212> PRT 

<213> Hansenula polymorpha 



<400> 2 



Met Asp Phe Met Lys Pro Glu Thr Val Leu Asp Leu Gly Asn lie Arg 
15 10 15 



Asp Ala Leu Val Arg Met Glu Asp Thr lie lie Phe Asn Phe lie Glu 
20 25 30 



Arg Ser Gin Phe Tyr Ala Ser Pro Ser Val Tyr Lys Val Asn Gin Phe 
35 40 45 



Pro lie Pro Asn Phe Asp Gly Ser Phe Leu Asp Trp Leu Leu Ser Gin 
50 55 - 60 



His Glu Arg lie His Ser Gin Val Arg Arg Tyr Asp Ala Pro Asp Glu 
65 70 75 80 



Val Pro Phe Phe Pro Asn Val Leu Glu Lys Thr Phe Leu Pro Lys lie 
85 90 95 



Asn Tyr Pro Ser Val Leu Ala Ser Tyr Ala Asp Glu lie Asn Val Asn 
100 105 110 



Lys Glu lie Leu Lys lie Tyr Thr Ser Glu lie Val Pro Gly lie Ala 
115 120 125 



Ala Gly Ser Gly Glu Gin Glu Asp Asn Leu Gly Ser Cys Ala Met Ala 
130 135 140 



Asp lie Glu Cys Leu Gin Ser Leu Ser Arg Arg lie His Phe Gly Arg 
145 150 155 160 



Phe Val Ala Glu Ala Lys Phe He Ser Glu Gly Asp Lys He Val Asp 
165 170 175 



Leu He Lys Lys Arg Asp Val Glu Gly He Glu Ala Leu He Thr Asn 
180 185 190 



Ala Glu Val Glu Lys Arg He Leu Asp Arg Leu Leu Glu Lys Gly Arg 
195 200 205 



Ala Tyr Gly Thr Asp Pro Thr Leu Lys Phe Thr Gin His He Gin Ser 
210 215 220 



Lys Val Lys Pro Glu Val He Val Lys He Tyr Lys Asp Phe Val He 





225 



230 



235 



240 



Pro Leu Thr 



Lys Lys Val Glu Val Asp Tyr Leu Leu Arg Arg Leu Glu 
245 250 "* 255 



Asp Glu Glu 



Asp Asp Asp Ala Thr Gin Lys Ser Gly Gly Tyr Val Asp 
260 265 270 



Arg Phe Leu 
275 



Ser Ser Gly Leu Tyr 
280 



<210> 3 

<211> 1655 

<212> DNA 

<213> Hansenula polymorpha 
<220> 

<221> gene 

<222> (1) . . (1655) 

<223> 1,8 kb genomic DNA-f ragment from Hanseula polymorpha 



<220> 

<221> gene 

<222> (1) . . (1655) 

<223> 1,8 kb genomic DNA-f ragment from Hansenula polymorpha 
<400> 3 

cccggcccaa tgccagcaat atggagacgt ttaggcagaa taggcgttcc atacttctca 60 

cgctgcttgt tgccaccgga atatacaccg cattgcagtt tgcacacatc atactatatg 120 

acgattacat tggcggaacg tatcgcgagt cgctcacgag acgcattaga atgacagaga 180 

aatcgcgaaa cgaccttata gacgcacgtg aaaactacgg gtttggaggc agcaaggagg 24 0 

agcgaatcca gcggtttttg tggttcagac atctttcgtg gcttttaggc gaggataagc 300 

gaacttgagg agcgtttttt ttttcctgtt tagtttttgt aggtatggac tttatgaagc 360 

cagaaacagt gctggacctt ggcaacatta gagatgcctt ggtccggatg gaggatacga 42 0 

tcatcttcaa ctttatcgag cggtcgcagt tctatgcgtc gccctcggta tacaaagtca 480 

accagttccc tattcccaac ttcgacggct cgttcttgga ctggctgttg tcgcagcacg 540 

agcgaatcca ttcgcaggtg aggagatacg acgcgccaga cgaggtgcct tttttcccca 600 

acgtgctgga aaaaacgttt ctgcccaaga tcaactaccc atcggtgcta gcctcctacg 660 

cggatgaaat caacgtcaac aaagagatac tcaagatcta cacgtcagag atagtaccag 72 0 





# 






• 




gaatagctgc 


aggcagcgga 


gagcaggagg acaaccttgg ctcgtgcgca 


atggccgaca 


tcgagtgcct 


gcagtcgcta 


tccagaagaa 


tccattttgg 


ccgttttgtc 


gcagaggcta 


aatttatcag 


tgagggggac 


aagattgtgg 


atctgatcaa 


aaagagagat 


gtggaaggca 


ttgaggcgct 


catcacaaac 


gccgaggtcg 


aaaaacggat 


cttggacaga 


cttctggaga 


agggaagggc 


gtatggaaca 


gacccgacac 


taaagttcac 


gcagcacatt 


cagagcaagg 


tgaagcccga 


ggtgattgtg 


aaaatctaca 


aggatttcgt 


gattccgctc 


acgaagaagg 


tcgaagtcga 


ctacttgctg 


agacggctgg 


aggacgagga 


ggacgatgat 


gcgacgcaga 


aaagcggcgg 


ctacgttgac 


cggtttctct 


cctctggctt 


gtactagaaa 


ttaaaatttt 


cagtacttta 


attattctcg 


aattctagtt 


cagataccgc 


atggtaattt 


caaaggccag 


aaaagtggcc 


gcgttggctg 


gggcagctct 


cagaatagtc 


ggcgagaatc 


ctttgactag 


cccccaggca 


ccgctctgtc 


tccaaatacc 


cctaatagtc 


tcaacagcat 


ttctataaac 


cagcttcttg 


tagttgtccg 


tctgcatgtt 


ggacttgatc 


acatcgatcg 


gataaatact 


gaaccacatc 


ccgtaacctg 


ccagcgcccc 


aaagacgcag 


agcttccagt 


tctcgatgtc 


cttcctggca 


atattccgcg 


actcgatctc 


gtttttcacg 


agagcttcaa 


aagtcagaaa 


atacgctccg 


ctacccaaac 


tttctcttgc 


cagcgtaggt 


cccagacccc 


ggtagattaa 


cttgatgcct 


cccgtatggt 


acagcttctt 


gatcc 







<210> 5 

<211> 22 

<212> DNA 

<213> Artificial 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1655 



<210> 4 

<211> 20 

<212> DNA 

<213> Artificial 

<220> 

< 2 2 3 > 01 igonuc 1 eo t ide 

<400> 4 

aattaaccct cactaaaggg 2 0 



<220> 

<223> Oligonucleotide 



<400> 5 

gtaatacgac tcactatagg gc 



22 




<210> 6 

<211> 26 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 

<400> 6 

atatagatct acaaaaacta aacagg 26 



<210> 7 

<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 



<400> 7 

atatagatct gatgcgacgc agaaaagc 



28 
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Marked-up Sequence Listing 



SEQUENCE LISTING 

<110> Gellissen, Gerd 
Braus , Gerhard 
Pries, Ralph 
Krappmann, Sven 
Strasser, Alexander 

<120> Nucleic Acid Molecule Comprising a Nucleic Acid Coding for a Polypeptide 
with Chorismate Mutase Activity 

<130> 029474-5007-00 

<140> 10/042059 
<141> 2001-10-25 

<150> DE 199 19 124.7 
<151> 1999-04-27 

<160> 7 

<170> Patentln version 3.1 

<210> 1 

<211> 843 

<212> DNA 

<213> Hansenula polymorpha 
<220> 

<221> CDS 

<222> (1) . . (843) 

<223> 

<400> 1 

atg gac ttt atg aag cca gaa aca gtg ctg gac ctt ggc aac att aga 48 

Met Asp Phe Met Lys Pro Glu Thr Val Leu Asp Leu Gly Asn lie Arg 

1,5 10 15 

gat gcc ttg gtc egg atg gag gat acg ate ate ttc aac ttt ate gag 96 
Asp Ala Leu Val Arg Met Glu Asp Thr lie lie Phe Asn Phe lie Glu 

2 0 25 ,30 

egg teg cag ttc tat gcg teg ccc teg gta tac aaa gtc aac cag ttc 144 
Arg Ser Gin Phe Tyr Ala Ser Pro Ser Val Tyr Lys Val Asn Gin Phe 
35 40 45 

cct att ccc aac ttc gac ggc teg ttc ttg gac tgg ctg ttg teg cag 192 
Pro lie Pro Asn Phe Asp Gly Ser Phe Leu Asp Trp Leu Leu Ser Gin 
50 55 60 

cac gag cga ate cat teg cag gtg agg aga tac gac gcg cca gac gag 240 
His Glu Arg lie His Ser Gin Val Arg Arg Tyr Asp Ala Pro Asp Glu 
65 70 75 80 

gtg cct ttt ttc ccc aac gtg ctg gaa aaa acg ttt ctg ccc aag ate 288 
Val Pro Phe Phe Pro Asn Val Leu Glu Lys Thr Phe Leu Pro Lys lie 
85 90 95 



aac tac cca teg gtg eta gcc tec tac gcg gat gaa ate aac gtc aac 
Asn Tyr Pro Ser Val Leu Ala Ser Tyr Ala Asp Glu lie Asn Val Asn 



336 



100 105 110 

aaa gag ata etc aag ate tac acg tea gag ata gta cca gga ata get 3 84 

Lys Glu lie Leu Lys lie Tyr Thr Ser Glu lie Val Pro Gly lie Ala 
115 120 125 

gca ggc age gga gag cag gag gac aac ctt ggc teg tgc gca atg gee 4 32 

Ala Gly Ser Gly Glu Gin Glu Asp Asn Leu Gly Ser Cys Ala Met Ala 
130 * 135 140 

gac ate gag tgc ctg cag teg eta tec aga aga ate cat ttt ggc cgt 4 80 

Asp lie Glu Cys Leu Gin Ser Leu Ser Arg Arg lie His Phe Gly Arg 
145 150 155 160 

ttt gtc gca gag get aaa ttt ate agt gag ggg gac aag att gtg gat 528 
Phe Val Ala Glu Ala Lys Phe lie Ser Glu Gly Asp Lys lie Val Asp 
165 170 175 

ctg ate aaa aag aga gat gtg gaa ggc att gag gcg etc ate aca aac 57 6 

Leu lie Lys Lys Arg Asp Val Glu Gly lie Glu Ala Leu lie Thr Asn 
180 185 190 

gee gag gtc gaa aaa egg ate ttg gac aga ctt ctg gag aag gga agg 624 
Ala Glu Val Glu Lys Arg lie Leu Asp Arg Leu Leu Glu Lys Gly Arg 
195 200 205 

gcg tat gga aca gac ccg aca eta aag ttc acg cag cac att cag age 672 
Ala Tyr Gly Thr Asp Pro Thr Leu Lys Phe Thr Gin His lie Gin Ser 
210 215 220 

aag gtg aag ccc gag gtg att gtg aaa ate tac aag gat ttc gtg att 72 0 

Lys Val Lys Pro Glu Val lie Val Lys lie Tyr Lys Asp Phe Val He 
225 230 235 240 

ccg etc acg aag aag gtc gaa gtc gac tac ttg ctg aga egg ctg gag 768 
Pro Leu Thr Lys Lys Val Glu Val Asp Tyr Leu Leu Arg Arg Leu Glu 
245 250 255 

gac gag gag gac gat gat gcg acg cag aaa age ggc ggc tac gtt gac 816 
Asp Glu Glu Asp Asp Asp Ala Thr Gin Lys Ser Gly Gly Tyr Val Asp 
260 265 270 

egg ttt etc tec tct ggc ttg tac tag 843 
Arg Phe Leu Ser Ser Gly Leu Tyr 
275 280 



<210> 2 

<211> 280 

<212> PRT 

<213> Hansenula polymorpha 

<400> 2 

Met Asp Phe Met Lys Pro Glu Thr Val Leu Asp Leu Gly Asn He Arg 
1 5 10 15 

Asp Ala Leu Val Arg Met Glu Asp Thr He He Phe Asn Phe He Glu 
20 25 30 



Arg Ser Gin Phe Tyr Ala Ser Pro Ser Val Tyr Lys Val Asn Gin Phe 
35 40 45 



Pro lie Pro Asn Phe Asp Gly Ser Phe Leu Asp Trp Leu Leu Ser Gin 
50 55 60 



His Glu Arg lie His Ser Gin Val Arg Arg Tyr Asp Ala Pro Asp Glu 
65 70 75 80 



Val Pro Phe Phe Pro Asn Val Leu Glu Lys Thr Phe Leu Pro Lys lie 
85 90 95 



Asn Tyr Pro Ser Val Leu Ala Ser Tyr Ala Asp Glu He Asn Val Asn 
100 105 110 



Lys Glu He Leu Lys He Tyr Thr Ser Glu He Val Pro Gly He Ala 
115 120 125 



Ala Gly Ser Gly Glu Gin Glu Asp Asn Leu Gly Ser Cys Ala Met Ala 
130 135 140 



Asp He Glu Cys Leu Gin Ser Leu Ser Arg Arg He His Phe Gly Arg 
145 150 155 160 



Phe Val Ala Glu Ala Lys Phe He Ser Glu Gly Asp Lys He Val Asp 
165 170 J 175 



Leu He Lys Lys Arg Asp Val Glu Gly He Glu Ala Leu lie Thr Asn 
180 185 190 



Ala Glu Val Glu Lys Arg He Leu Asp Arg Leu Leu Glu Lys Gly Arg 
195 200 205 



Ala Tyr Gly Thr Asp Pro Thr Leu Lys Phe Thr Gin His He Gin Ser 
210 215 ~ 220 



Lys Val Lys Pro Glu Val He Val Lys He Tyr Lys Asp Phe Val He 
225 230 235 240 



Pro Leu Thr Lys Lys Val Glu Val Asp Tyr Leu Leu Arg Arg Leu Glu 
245 250 255 



Asp Glu Glu Asp Asp Asp Ala Thr Gin Lys Ser Gly Gly Tyr Val Asp 
260 265 * 270 



Arg Phe Leu Ser Ser Gly Leu Tyr 
275 280 



<210> 3 
<211> 1655 



<212> DNA 

<213> Hansenula polymorpha 
<220> 

<221> gene 

<222> (1)..(1655) 

<223> 1,8 kb genomic DNA-fragment from Hanseula polymorpha 

<220> 

<221> gene 

<222> (1) . . (1655) 

<223> 1,8 kb genomic DNA-fragment from Hansenula polymorpha 

<400> 3 



cccggcccaa 


tgccagcaat 


atggagacgt 


ttaggcagaa 


taggcgttcc 


atacttctca 


60 


cgctgcttgt 


tgccaccgga 


atatacaccg 


cattgcagtt 


tgcacacatc 


atactatatg 


120 


acgattacat 


tggcggaacg 


tatcgcgagt 


cgctcacgag 


acgcattaga 


atgacagaga 


180 


aatcgcgaaa 


cgaccttata 


gacgcacgtg 


aaaactacgg 


gtttggaggc 


agcaaggagg 


240 


agcgaatcca 


gcggtttttg 


tggttcagac 


atctttcgtg 


gcttttaggc 


gaggataagc 


300 


gaacttgagg 


agcgtttttt 


ttttcctgtt 


tagtttttgt 


aggtatggac 


tttatgaagc 


360 


cagaaacagt 


gctggacctt 


ggcaacatta 


gagatgcctt 


ggtccggatg 


gaggatacga 


420 


tcatcttcaa 


ctttatcgag 


cggtcgcagt 


tctatgcgtc 


gccctcggta 


tacaaagtca 


480 


accagttccc 


tattcccaac 


ttcgacggct 


cgttcttgga 


ctggctgttg 


tcgcagcacg 


540 


agcgaatcca 


ttcgcaggtg 


aggagatacg 


acgcgccaga 


cgaggtgcct 


tttttcccca 


600 


acgtgctgga 


aaaaacgttt 


ctgcccaaga 


tcaactaccc 


atcggtgcta 


gcctcctacg 


660 


cggatgaaat 


caacgtcaac 


aaagagatac 


tcaagatcta 


cacgtcagag 


atagtaccag 


720 


gaatagctgc 


aggcagcgga 


gagcaggagg 


acaaccttgg 


ctcgtgcgca 


atggccgaca 


780 


tcgagtgcct 


gcagtcgcta 


tccagaagaa 


tccattttgg 


ccgttttgtc 


gcagaggcta 


840 


aatttatcag 


tgagggggac 


aagattgtgg 


atctgatcaa 


aaagagagat 


gtggaaggca 


900 


ttgaggcgct 


catcacaaac 


gccgaggtcg 


aaaaacggat 


cttggacaga 


cttctggaga 


960 


agggaagggc 


gtatggaaca 


gacccgacac 


taaagttcac 


gcagcacatt 


cagagcaagg 


1020 


tgaagcccga ggtgattgtg 


aaaatctaca 


aggatttcgt 


gattccgctc 


acgaagaagg 


1080 


tcgaagtcga 


ctacttgctg 


agacggctgg 


aggacgagga 


ggacgatgat 


gcgacgcaga 


1140 


aaagcggcgg 


ctacgttgac 


cggtttctct 


cctctggctt 


gtactagaaa 


ttaaaatttt 


1200 


cagtacttta 


attattctcg 


aattctagtt 


cagataccgc 


atggtaattt 


caaaggccag 


1260 


aaaagtggcc 


gcgttggctg 


gggcagctct 


cagaatagtc 


ggcgagaatc 


ctttgactag 


1320 


cccccaggca 


ccgctctgtc 


tccaaatacc 


cctaatagtc 


tcaacagcat 


ttctataaac 


1380 


cagcttcttg 


tagttgtccg 


tctgcatgtt 


ggacttgatc 


acatcgatcg 


gataaatact 


1440 




gaaccacatc ccgtaacctg ccagcgcccc aaagacgcag agcttccagt tctcgatgtc 1500 

cttcctggca atattccgcg actcgatctc gtttttcacg agagcttcaa aagtcagaaa 1560 

atacgctccg ctacccaaac tttctcttgc cagcgtaggt cccagacccc ggtagattaa 1620 

cttgatgcct cccgtatggt acagcttctt gatcc 1655 



<210> 


4 


<211> 


20 


<212> 


DNA 



<213> Artificial 
<220> 

<22 3> Oligonucleotide 

<400> 4 



aattaaccct 


<210> 


5 


<211> 


22 


<212> 


DNA 



20 



<213> Artificial 
<220> 

<22 3> Oligonucleotide 

<400> 5 

gtaatacgac tcactatagg gc 22 

<210> 6 

<211> 26 

<212> DNA 

<213> Artificial 



<220> 

<223> Oligonucleotide 
<400> 6 

atatagatct acaaaaacta aacagg 26 

<210> 7 



<211> 28 

<212> DNA 

<213> Artificial 

<220> 

<223> Oligonucleotide 



<400> 7 

atatagatct gatgcgacgc agaaaagc 



28 



